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Abstract 

This is a partial account of the fascinating history of Distance Geometry. We make no claim 
to completeness, but we do promise a dazzling display of beautiful, elementary mathematics. We 
prove Heron’s formula, Cauchy’s theorem on the rigidity of polyhedra, Cayley’s generalization of 
Heron’s formula to higher dimensions. Monger’s characterization of abstract semi-metric spaces, a 
result of Godel on metric spaces on the sphere, and Schoenberg’s equivalence of distance and positive 
semidefinite matrices, which is at the basis of Multidimensional Scaling. 

Keywords: Euler’s conjecture, Cayley-Menger determinants, Multidimensional scaling, Euclidean 
Distance Matrix 


1 Introduction 


Distance Geometry (DG) is the study of geometry with the basic entity being distance (instead of lines, 
planes, circles, polyhedra, conics, surfaces and varieties). As with most everything else, it all began with 
the Greeks, specifically Heron, or Hero, of Alexandria sometime between 150BG and 250AD, who showed 
how to compute the area of a triangle given its side lengths [36] . 

After a hiatus of almost two thousand years, we reach Arthur Gayley’s: the first paper of volume I of 
his Collected Papers, dated 1841, is about the relationships between the distances of five points in space 
[7]. The gist of what he showed is that a tetrahedron can only exist in a plane if it is flat (in fact, he 
discussed the situation in one more dimension). This yields algebraic relations on the side lengths of the 
tetrahedron. 

Hilbert’s influence on foundations and axiomatization was very strong in the 1930s Mitteleuropa m- 
This pushed many people towards axiomatizing existing mathematical theories m- Karl Menger, a young 
professor of geometry at the University of Vienna and an attendee of the Vienna Gircle, proposed in 1928 
a new axiomatization of metric spaces using the concept of distance and the relation of congruence, and, 
using an extension of Gayley’s algebraic machinery (which is now known as Cayley-Menger determinant), 
generalized Heron’s theorem to compute the volume of arbitrary K-dimensional simplices using their side 
lengths p?T] . 

The Vienna Circle was a group of philosophers and mathematicians which convened in Vienna’s 
Reichsrat cafe around the nineteen-thirties to discuss philosophy, mathematics and, presumably, drink 
coffee. When the meetings became excessively politicized, Menger distanced himself from it, and organized 
instead a seminar series, which ran from 1929 to 1937 [34]. A notable name crops up in the intersection 
of Monger’s geometry students, the Vienna Circle participants, and the speakers at Monger’s Kolloquium: 
Kurt Godel. Most of the papers Godel published in the Kolloquium’s proceedings are about logic and 
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foundations0 but two, dated 1933, are about the geometry of distances on spheres and surfaces. The 
first [34l 18 Feb. 1932, p. 198] answers a question posed at a previous seminar by Laura Klanfer, and 
shows that a set X of four points in any metric space, congruent to four non-coplanar points in can 
be realized on the surface of a three-dimensional sphere using geodesic distances. The second [33] 17 
May 1933, p. 252] shows that Cayley’s relationship hold locally on certain surfaces which behave locally 
like Euclidean spaces. 

The pace quickens: in 1935, Isaac Schoenberg published some remarks on a paper [33] by Frechet 
on the Annals of Mathematics, and gave, among other things, an algebraic proof of equivalence between 
Euclidean Distance Matrices (EDM) and Gram matrices. This is almost the same proof which is nowadays 
given to show the validity of the classical Multidimensional Scaling (MDS) technique [5] § 12.1]. 

This brings us to the computer era, where the historical account ends and the contemporary treatment 
begins. Computers allow the efficient treatment of masses of data, some of which are incomplete and 
noisy. Many of these data concern, or can be reduced to, distances, and DC techniques are the subject 
of an application-oriented renaissance [IZl [35]. Motivated by the Global Positioning System (GPS), 
for example, the old geographical concept of trilateration (a system for computing the position of a 
point given its distances from three known points) makes its way into DC in wireless sensor networks 
[12] . Wiithrich’s Nobel Prize for using Nuclear Magnetic Resonance (NMR) techniques in the study of 
proteins brings DC to the forefront of structural bioinformatics research [^. The massive use of robotics 
in mechanical production lines requires mathematical methods based on DC [38j . 

DC is also tightly connected with graph rigidity [^. This is an abstract mathematical formulation of 
statics, the study of structures under the action of balanced forces [30], which is at the basis of architecture 
[33]. Rigidity of polyhedra gave rise to a conjecture of Euler’s [T3] about closed polyhedral surfaces, which 
was proved correct only for some polyhedra: strictly convex [B], convex and higher-dimensional [3], and 
genericl^ [17j . It was however disproved in general by means of a very special, non-generic nonconvex 
polyhedron [8]. 

The rest of this paper will focus on the following results, listed here in chronological order: Heron’s 
theorem (Sect.[2l), Euler’s conjecture and Cauchy’s proof for strictly convex polyhedra (Sect.|31), Cayley- 
Menger determinants (Sect. [4]), Monger’s axiomatization of geometry by means of distances (Sect. [5]), a 
result by Godel’s concerning DG on the sphere (Sect.lB]), and Schoenberg’s equivalence (Sect. [7]) between 
EDM and Positive Semidefinite Matrices (PSD). There are many more results in DG: this is simply our 
own choice in terms of importance and beauty. 


2 Heron’s formula 


Heron’s formula, which is usually taught at school, relates the area M of a triangle to the length of its 
sides a, b, c and its semiperimeter s = as follows: 

A = \/ s{s — a){s — b){s — c). (1) 

^The first public mention of Godel’s completeness theorem m (which was also the subject of his Ph.D. thesis) was 
given at the Kolloquium |341 14 May 1930, p. 135], just three months after obtaining his doctorate from the University 
of Vienna. As for his incompleteness theorem |19| . F. Alt recalls |341 Afterword] that Godel’s seminar |34l 22 Jan. 1931, 
p. 168] appears to have been the first oral presentation of its proof: 

There was the unforgettable quiet after Godel’s presentation, ended by what must be the understatement of 
the century: “That is very interesting. You should publish that.” Then a question: “You use Peano’s system 
of axioms. Will it work for other systems?” Godel, after a few seconds of thought: “Yes, any system broad 
enough to define the field of integers.” Olga Taussky (half-smiling): “The integers do not constitute a field!” 

Godel, who knew this as well as anyone, and had only spoken carelessly: “Well, the. . . the. .. the domain of 
integrity of the integers.” And final relaxing laughter. 

The incompleteness theorem was first mentioned by Godel during a meeting in Konigsberg, in Sept. 1930. Menger, who 
was travelling, had been notified immediately: with John Von Neumann, he was one of the first to realize the importance 
of Godel’s result, and began lecturing about it immediately |34l Biographical introduction]. 

^A polyhedron is generic if no algebraic relations on Q hold on the components of the vectors which represent its vertices. 
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There are many ways to prove its validity. Shannon Umberger, a student of the “Foundations of Geometry 
I” course given at the University of Georgia in the fall of 2000, proposes, as part of his final project^ 
three detailed proofs: an algebraic one, a geometric one, and a trigonometric one. John Conway and 
Peter Doyle discuss Heron’s formula proofs in a publically available email exchangc0 from 1997 to 2001. 

Our favourite proof is based on complex numbers, and was submittecjfl to the “Art of Problem Solving” 
online school for gifted mathematics students by Miles Edward^ when he was studying at Lassiter High 
School in Marietta, Georgia. 

2.1 Theorem (Heron’s formula |36| 1 

Let A be the area of a triangle with side lengths a, b, c and semiperimeter length s = ^{a + b + c). Then 
A = y/ s{s — a){s — b){s — c). 

Proof. m Consider a triangle with sides a,b,c (opposite to the vertices A,B,C respectively) and its 
inscribed circle centered at O with radius r. The perpendiculars from O to the triangle sides split a into 
y,z, b into x,z and c into x,y as shown in Fig. [T] Let u,v,w be the segments joining O with A,B,C, 
respectively. First, we note that 2a + 2/3 + 27 = 27r, which implies a + /3 + 7 = tt. Next, the following 



Figure 1: Heron’s formula: a proof using complex numbers. 

complex identities are easy to verify geometrically in Fig. [I] 

r + ix = 
r + iy = ve’’^ 

r + iz = 

These imply: 

(r + ix){r + iy)(r + iz) = = uvwe™ = —uvw, 

" http: //j Wilson. coe. uga. edu/emt668/einat6680.2000/uinberger/MATH7200/HeronForinulaPro j set/f inalpro j ect. html 
https: //math. dartmouth. edu/~ doyle/docs/heron/heron. txt 
http: //www. artof problemsolving. com/Resources/Papers/Heron. pdf 
^Also see http: //newsinf o. iu.edu/news/page/normal/13885.html and http: //www. jstor. org/stable/10.4169/amer .math, 
for more recent career achievements of this gifted student. 
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where the last step uses Euler’s identity 6*’’’ + ! = 0 [IHl I-VIII, § I38-I40, p. 148]. Since —uvw is real, the 
imaginary part of (r + ix)(r + iy)(r + iz) must be zero. Expanding the product and rearranging terms, 
we get r‘^{x + y + z) = xyz. Solving for r, we have the nonnegative root 


r=J ■ ( 2 ) 

y X + y + z 

We can write the semiperimeter of the triangle ABC as s = ^{a+b+c) = ^{y+z+x+z+x+y) = x+y+z. 
Moreover, 


s—a = x+y+z—y—z=x 
s—b = x+y+z—x—z=y 
s — c = x + y + z —x — y = z, 

so xyz = (s — a) (s — b)(s — c), which implies that Eq. ([2|) becomes: 


(s — a){s — b){s — c) 


We now write the area A of the triangle ABC by summing it over the areas of the three triangles AOB, 
BOC, CO A, which yields: 

. 1, , . CL + b + C t —-r-T-TT-T-r 

A = 2 V ® + rb + rc) = r - - - = rs = y s{s — a)[s — b)[s — c). 


as claimed. 


□ 


3 Euler’s conjecture and the rigidity of polyhedra 


Consider a square with unit sides, in the plane. One can shrink two opposite angles and correspondingly 
widen the other two to obtains a rhombus (see Fig. ID), which has the same side lengths but a different 
shape: no sequence of rotations, translations or reflections can turn one into the other. In other words, 
a square is flexible. By contrast, a triangle is not flexible, or rigid. 



Figure 2: A square is flexed into a rhombus. The set of faces (the edges) are the same, and each maintains 
pairwise distances through the flexing, i.e. two points on the same edge have the same distance on the 
left as on the right figure. 


Euler conjectured in 1766 m that all three-dimensional polyhedra are rigid. The conjecture appears 
at the end of the discussion about the problem Invenire duas superficies, quarum alteram in alteram 
transformare liceat, ita ut in utraque singula puncta homologa easdem inter se teneat distantias, i.e.: 

To find two surfaces for which it is possible to transform one into the other, in such a way 
that corresponding points on either keep the same pairwise distance, (f) 
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Towards the end of the paper, Euler writes Statim enim atque figura undique est clausa, nullam amplius 
mutationem patitur, which means “As soon as the shape is everywhere closed, it can no longer be trans¬ 
formed” . Although the wording appears ambiguous by today’s standards, scholars of Euler and rigidity 
agree: what Euler really meant is that 3D polyhedra are rigid m- 

To better understand this statement, we borrow from [1] the precise definition of a polyhedral^ a 
family fC of points, open segments and open triangles is a triangulation if (a) no two elements of fC have 
common points, and (b) all sides and vertices of the closure of any triangle of /C, and both extreme 
points of the closure of any segment of K. are all in /C themselves. Given a triangulation /C in (where 
K € {1,2,3}), the union of all points of /C with all points in the segments and triangles of K. is called 
a polyhedron. Note that several triangular faces can belong to the same affine space, thereby forming 
polygonal faces. 

Each polyhedron has an incidence structure of points on segments and segments on polygonal (not 
necessarily triangular) faces, which induces a partial order (p.o.) based on set inclusion. Eor example, 
the closure of the square ABCD contains the closures of the segments AB, BC, CD, DA, each of which 
contains the corresponding adjacent points A, B, B, C, C, D, D, A. Accordingly, the p.o. is A C AB, DA, 
B c AB,BC-, C C BC,CD-, D c CD, DA] AB,BC,CD,DA c ABCD. Since this p.o. also has a 
bottom element (the empty set) and a top element (the whole polyhedron), it is a lattice. A lattice 
isomorphism is a bijective mapping between two lattices which preserves the p.o. Two polyhedra P, Q 
are eombinatorially equivalent if their triangulations are lattice isomorphic. If, moreover, all the lattice 
isomorphic polygonal faces of P, Q are exactly equal, the polyhedra are said to be facewise equal. 

Under the above definition, nothing prevents a polyhedron from being nonconvex (see Fig. [3|). It 
is known that every closed surface, independently of the convexity of its interior, is homeomorphic (in¬ 
tuitively: smoothly deformable in) to some polyhedron (again [TJ § 2.2]). This is why we can replace 
“surface” with “polyhedra”. 



Figure 3: A nonconvex polyhedron. 


The “rigidity” implicit in Euler’s conjecture should be taken to mean that no point of the polyhedron 
can undergo a continuous motion under the constraint that the shape be the same at each point of the 
motion. As for the concept of “shape”, it is linked to that of distance, as appears clear from (f). The 
following is therefore a formal restatement of Euler’s conjecture: two eombinatorially equivalent facewise 
equal polyhedra must be isometric under the Euclidean distance, i.e. each pair of points in one polyhedron 
is equidistant with the corresponding pair in the other. 

A natural question about the Euler conjecture stems from generalizing the example in Fig. [2] to 3D 
(see Fig. HI). Does this not disprove the conjecture? The answer is no: all the polygonal faces in the cube 

^This definition is different from the usual definition employed in convex analysis, i.e. that a polyhedron is an intersection 
of half-spaces; however, a convex polyhedron in the sense given here is the same as a pol 3 d.ope in the sense of convex analysis. 
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Figure 4: A cube can be transformed into a rhomboid, but the set of faces is not the same anymore 
(accordingly, corresponding point pairs may not preserve their distance, as shown). 


are squares, but this does not hold in the rhomboid. The question is more complicated than it looks at 
first sight, which is why it took 211 years to disprove it. 


3.1 Strictly convex polyhedra: Cauchy’s proof 

Although Euler’s conjecture is false in general, it is true for many important subclasses of polyhedra. 
Cauchy proved it true for strictly convex polyhedral There are many accounts of Cauchy’s proof: 
Cauchy’s original text, still readable today [B]; Alexandrov’s book [2], Lyusternik’s book [551 § 20], 
Stoker’s paper [IS], Connelly’s chapter just to name a few. Here we follow the treatment given by Pak 

m- 

We consider two combinatorially equivalent, facewise equal strictly convex polyhedra P, Q, and aim 
to show that P and Q are isometric. 

For a polyhedron P we consider its associated graph G{P) = {V, E), where V are the points of P and 
E its segments. Note that G{P) only depends on the incidence structure of the polygonal faces, segments 
and points of P. Since P,Q are combinatorially equivalent, G{P) = G{Q). Consider the dihedral angled 
ctuvjPuv on P,Q induced by the segment represented by the edge {u,!;} G E. We assign to each edge 
{u,v} G E a. label £uv = sgn{l3uv — ctuv) (so £uv G {—1,0,1}), and consider, for each v G V, the edge 
sequence dy = ({u,u} | u G N{v)), where N{v) is the set of nodes adjacent to v. The order of the edges 
in dy is given by any circuit around the polygon p{v) obtained by intersecting P with a plane 7 which 
separates v from the other vertices in V (this is possible by strict convexity, see Fig. |S|). It is easy to 
see that every edge {u, u} G dy corresponds to a vertex of p{v). Therefore, a circuit over p[v) defines an 
order over dy. We also assume that this order is periodic, i.e. its last element precedes the first one. Any 
such sequence dy naturally induces a sign sequence sy = {£uv \ {u, v} G dy); we let Sy be the sequence Sy 
without the zeros, and we count the number ruy of sign changes in Sy, including the sign change occurring 
between the last and first elements. 

3.1 Lemma 

Eor all V gV, my is even. 


Proof. Suppose is odd, and proceed by induction on my', if my = 1, then there is only one sign 
change. So, the first edge {m,u} in dy to be labelled with f^y ^ 0 has the property that, going around 
the periodic sequence with only one sign change, (u, v} is also labelled with —£uv, which yields +1 = — 1 , 
a contradiction. A trivial induction step yields the same contradiction for all odd my. □ 


We now state a fundamental technical lemma, and provide what is essentially Cauchy’s proof, rephrased 

®In fact Cauchy’s proof contained two mistakes, corrected by Steinitz 1281 p. 67] and Lebesgue. 

®Two half-planes in K® intersecting on a line L define an angle smaller than tt called the dihedral angle at L. 















3 EULER’S CONJECTURE AND THE RICIDITY OE POLYHEDRA 


7 
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Figure 5: The plane 7 separating v from the other vertices in the strictly convex polyhedron P, and 
the intersection polygon p(v) dehned by wi,..., W4. The line L (lying in p(v)) separates the +1 and —1 
labels applied to the points wi,... ,104 of intersections between the edges of P and p{v). 


as in [28l Lemma 2 in § 20]. 

3.2 Lemma 

If P is strictly convex, then for each v G V we have either ruy = 0 or niy > 4 . 


Proof By Lemma [3Tj for each v € V we have niy ^ {1, 3}, so we aim to show that ruy ^ 2. Suppose, 
to get a contradiction, that niy = 2, and consider the polygon p{v) as in Fig. [S] By the correspondence 
between edges in ay and vertices of p(y), the labels fyy are vertex labels in p{v). Since there are only 
two sign changes, the sequence of vertex labels can be partitioned in two contiguous sets of +1 and — 1 
(possibly interspersed by zeros). By convexity, there exists a line L separating the +1 and the —1 vertices 
(see Fig.J^. Since all of the angles marked +1 strictly increase, the segment L = L r\p{v) also strictly 
increaseqij; but, at the same time, all of the angles marked —1 strictly decrease, so the segment L also 
strictly decreases, which means that the same segment L both strictly increases and decreases, which is 
a contradiction (see Fig. [51). □ 



Figure 6 : Visual representation of the contradiction in the proof of LemmaThe angles at all vertices 
labelled +1 increase their magnitude, and those at —1 decrease: it follows that L both increases and 
decreases its length, a contradiction. 


^*^This statement was also proved in Cauchy’s paper [^, but this proof contained a serious flaw, later corrected by Steinitz. 
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3.3 Theorem (Cauchy’s Theorem [6]^ 

If two closed convex polyhedra P, Q are combinatorially equivalent and facewise equal, they are isometric. 

We only present the proof of the base case where 

Vv € V {my >0) V 'iv €V {my = 0) (3) 

and G{P) = G{Q) is a connected graph, and refer the reader to [371 P- 251] for the other cases (which 

are mostly variations of the ideas given in the proof below). 

Proof If my = 0 for all v &V, \i means that all of the dihedral angles in P are equals to those of Q, 
which implies isometry. So we assume the alternative w.r.t. Eq. above: \/v € V {my > 0), and aim for 
a contradiction. Let M = Lemma 13^ and because my > 0 for each v, we have M > 4\V\, 

a lower bound for M. We now construct a contradicting upper bound for M. For every h > 3, we let Fh 
be the number of polygonal faces of P with h sides (or edges). The total number of polygonal faces in 
P {or Q) is P = and the total number of edges is therefore S = (we divide by 2 since 

each edge is counted twice in the sum — one per adjacent face — given that P, Q are closed). A simple 
term by term comparison of F and £ yields 4£ — 4T = Since each polygonal face / of 

P is itself closed, the number c/ of sign changes of the quantities iyy over all edges {u,v} adjacent to 
the face / is even, by the same argument given in Lemma 13.11 It follows that if the number h of edges 
adjacent to the face / is even, then c/ < h, and Cf < h — 1 ii h is odd. This allows us to compute an 
upper bound on M: 

M < 2 E 3 + 41^4 + 41^5 + 6^6 + 6 EV + SEg + ... 

< 2E3 + 4E4 + 6E5 + SEg + IOE7 + 12 E 8 + ... 

< 4£-4P = 4\V\ - 8. 

The middle step follows by simply increasing each coefficient. The last step is based on Euler’s charac¬ 
teristic [13]: \V\ + P — £ = 2. Hence we have 4|E| < M < 4|E| — 8 , which is a contradiction. □ 


3.2 Euler was wrong: Connelly’s counterexample 

Proofs behind counterexamples can rarely be termed “beautiful” since they usually lack generality (as 
they are applied to one particular example). Counterexamples can nonetheless be dazzling by themselves. 
Connelly’s counterexample [ 8 ] to the Euler’s conjecture consists in a very special non-generic nonconvex 
polyhedron which flexes, while keeping combinatorial equivalence and facewise equality with all poly¬ 
hedra in the flex. Some years later, Klaus Steffen produced a much simpler polyhedron with the same 
pronertieJ^. It is this polyhedron we exhibit in Fig. [7| 


4 Cayley-Menger determinants and the simplex volume 


The foundation of modern DC, as investigated by Menger [33] and Blumenthal [3], rests on the fact that: 

the four-dimensional volume of a four-dimensional simplex embedded in three dimensional 
spaee is zero, (+) 

which we could also informally state as “flat simplices have zero volume”. This is related to DC because 
the volume of a simplex can be expressed in terms of the lengths of the simplex sides, which yields a 

^^See http://demonstrations.wolfram.com/SteffensFlexiblePolyhedron/ 
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Figure 7: Steffen’s polyhedron: the flex (these two images were obtained as snapshot from the Mathemat- 
ica |45j demonstration cited in Footnote HH). There is a rotation, in the direction showed by the arrows, 
around the edge which is emphasized on the right picture. The short upper right edge only appears 
shorter on the right because of perspective. 


polynomial in the length of the simplex side lengths that can be equated to zero. If these lengths are 
expressed in function of the vertex positions as Hxu — this yields a polynomial equation in the 

positions xi,..., X 5 of the simplex vertices in terms of its side lengths. Thus, if we know the positions of 
xi, ..., X4, we can compute the unknown position of X5 or prove that no such position exists, through a 
process called trilateration m- 

The proof of (*) was published by Arthur Cayley in 1841 [7], during his undergraduate studies. It is 
based on the following well-known lemma about determinants (stated without proof in Cayley’s paper). 

4.1 Lemma 

If A, B are square matrices having the same size, \AB\ = |A||i3|. 

4.2 Theorem (Cayley [7]1 

Given five points Xi,... ,X 5 G all belonging to an affine 3D subspace of let dij = ||xi — Xj \\2 for 
each i,j < 5. Then 

0 

4i 

dll 
dh 

1 


dl2 

dls 

du 

dls 

1 

0 

dls 

d^ 

“24 

dh 

1 

<^32 

0 

dl. 

dis 

1 

“42 


0 

dls 

1 

dh 

^53 

dli 

0 

1 

1 

1 

1 

1 

0 


We note that Cayley’s theorem is expressed for n = 5 points in but it also holds for n > 3 points 
in [4]. Cayley explicitly remarks that it holds for the cases n = 4 and n = 3 (see [42j VIII, § 5] 

for the proof of general n). The determinant on the right-hand side of Eq. is called Cayley-Menger 
determinant, denoted by A. We remark that in the proof below Xik is the fc-th component of Xi, for each 
z < 5, fc < 4. 

Proof We follow Cayley’s treatment. He pulls the following two matrices 


/ 


-2xii 

-2X12 

-2X13 

-2X14 

1 \ 


( 1 

1 

1 

1 

1 



l|a:2|r 

-2X21 

— 2X22 

-2X23 

-2X24 

1 


Xll 

X21 

X3I 

X41 

X5I 

0 


ll®3|r 

-2X31 

— 2X32 

— 2X33 

— 2X34 

1 

, B = 

X12 

X22 

X32 

X42 

X52 

0 



-2X41 

-2X42 

— 2X43 

— 2X44 

1 

Xl3 

X23 

X33 

X43 

X53 

0 



2X51 

— 2 xs 2 

— 2X53 

2X54 

1 


Xl4 

X24 

X34 

X44 

X54 

0 

V 

1 

0 

0 

0 

0 

0 y 



ll*2f 


ll*4|P 

ll*5f 

1 / 


out of a magic hat. He performs the product AB, re-arranging and collecting terms, and obtains a 6 x 6 
matrix where the last row and column are (1,1,1,1,1, 0), and the {i,j)-ih component is \\xi — Xj||| for 
every i,j < 5. To see this, it suffices to carry out the computations using Mathematica |45) : by way of 
an example, the first diagonal component of AB is ||xi|p — 2 X] ^^ifcXifc -I- ||xi|p = 0, and the component 

fc<4 
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on the first row, second column of AB is ||a;i|p — 2 ^ xikX 2 k + ||a^ 2 |P = ||a^i — X 2 \\^- In other words, 

k<4 

\AB\ is the Cayley-Menger determinant in Eq. (|3]). On the other hand, if we set X 4 k = 0 for each 
fc < 4, effectively projecting the five four-dimensional points in three-dimensional space, it is easy to 
show that |A| = |i?| = 0 since the 5-th columns of both A and the 5th row of B are zero. Hence we have 
0 = |^||i?| = \AB\ by Lemma l4Hl and \AB\ = 0 is precisely Eq. (|4]) as claimed. □ 


The missing link is the relationship of the Cayley-Menger determinant with the volume of an n- 
simplex. Since this is not part of Cayley’s paper, we only establish the relationship for n = 3. Let 
di 2 = a, di 3 = 6, ^23 = c. Then: 


0 1 
0 1 
0 1 

1110 


= — 2a^b^ + b'^ — 2a^c^ — 2b^c^ + 


— 16(s(s — a){s — b){s — c)). 


where s = ^{a+b+c) (this identity can be established by using e.g. Mathematica [15]). By Heron’s theorem 
(Thm. 12.II above! we know that the area of a triangle with side lengths a, 6, c is yj s(s — a){s — b)(s — c). 
So, for n = 3, the determinant on the left-hand side is proportional to the negative of the square of the 
triangle area. This result can be generalized to every value of n 11 n, § 40, p. 98]: it turns out that the 
n-dimensional volume 14, of an n-simplex in R" with side length matrix d = {dij j *, j ^ ^ + 1) is: 

1 /^, a 

“ 2 "( n !)2 


The beauty of Cayley’s proof is in its extreme compactness: it uses determinants to hide all the details 
of elimination theory which would be necessary otherwise. His paper also shows some of these details for 
the simplest case n = 3. The starting equations, as well as the symbolic manipulation steps, depend on n. 
Although Cayley’s proof is only given for n = 5, Cayley’s treatment goes through essentially unchanged 
for any number n of points in dimension n — 2. 


5 Menger’s characterization of abstract metric spaces 


At a time where mathematicians were heeding Hilbert’s call to formalization and axiomatization, Menger 
presented new axioms for geometry based on the notion of distance, and provided conditions for arbitrary 
sets to “look like” Euclidean spaces, at least distancewise [211 ED- Menger’s system allows a formal 
treatment of geometry based on distances as “internal coordinates”. The starting point is to consider 
the relations of geometrical figures having proportional distances between pairs of corresponding points, 
i.e. congruence. Menger’s definition of a congruence system is defined axiomatically, and the resulting 
characterization of abstract distance spaces with respect to subsets of Euclidean spaces (possibly his 
most important result) transforms a possibly infinite verification procedure (any subset of any number of 
points) into a finitistic one (any subset of n -|- 3 points, where n is the dimension of the Euclidean space). 

It is remarkable that almost none of the results below offers an intuitive geometrical grasp, such as the 
proofs of Heron’s formula and Cayley’s theorem do. As formal mathematics has it, part of the beauty in 
Menger’s work consists in turning the “visual” geometrical proofs based on intuition into formal symbolic 
arguments based on sets and relations. On the other hand, Menger himself gave a geometric intuition of 
his results in [221 P- 335], which we comment in Sect. 15.41 below. 
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5.1 Menger’s axioms 

Let 5 be a system of sets, and for any set S € S and any two (not necessarily distinct) points p,q € S, 
denote the couple {p, q) by pq. Menger defines a relation Ri by means of the following axioms. 

1. VS', T € S, Vp, q € S and Vr, s € T, we have either pq tv rs or pq rs but not both. 

2. VS € S and Vp, q € S we have pq tv qp. 

3. VS, T G S, Vp G S and Vr, s G T, we have pp ~ rs if and only if r = s. 

4. VS, T G S, Vp, q G S and Vr, s G T, if pg Ri rs then rs tv pq. 

5. VS, T,U G S, Vp, q G S, Vr, s G T and Vt, u GU ,\I pqtv rs and pg r; tu then rs r; tu. 

The couple (S,Ri) is called a congruence system, and the r; relation is called congruence. 

Today, we are used to think of relations as defined on a single set. We remark that in Menger’s 
treatment, congruence is a binary relation defined on sets of ordered pairs of points, where each point in 
each pair belongs to the same set as the other, yet left-hand and right-hand side terms may belong to 
different sets. We now interpret each axiom from a more contemporary point of view. 

1 . Axiom [1] states that Menger’s congruence relation is in fact a partial relation on = (IJS)^ (the 
Cartesian product of the union of all sets S' G 5 by itself), which is only defined for a couple pq G 
whenever 3S G S such that p,q G S. 

2. By axiom [21 the R relation acts on sets of unordered pairs of (not necessarily distinct) points; we 
call the set of all unordered pairs of points from all sets S G S. 

3. By axiom 121 rs is congruent to a pair pq where p = g if and only if r = s. 

4. Axiom m states that r is a symmetric relation. 

5. Axiom 121 states that r is a transitive relation. 

Note that r is also reflexive (i.e. pq tv pq) since pq tv qp ta pq by two successive applications of Axiom |21 
So, using today’s terminology, R is an equivalence relation defined on a subset of 


5.2 A model for the axioms 

Monger’s model for his axioms is a semi-metric space S, i.e. a set S of points such that to each unordered 
pair {p, g} of points in S we assign a nonnegative real number dpq which we call distance between p and 
g. Under this interpretation. Axiom |21 tells us that dpq = dqp for each pair of points p, g, and Axiom [2] 
tells us that rs is congruent to a single point if and only if drs = 0 , which, together with nonnegativity, 
are the defining properties of semi-metrics (the remaining property, the triangular inequality, tells semi¬ 
metrics apart from metrics). Thus, the set S of all semi-metric spaces together with the relation given 
by pq tv rs dpq = drs is a congruence system. 

5.3 A finitistic characterization of semi-metric spaces 

Two sets S,T G S are congruent if there is a map (called congruence map) (f> : S ^ T, such that 
pq tv (p(p)(l))q) for all p,q G S. We denote this relation by S R^ T, dropping the 4> if it is clear from the 
context. 
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5.1 Lemma 

Any congruence map (j) : S ^ T is injective. 

Proof. Suppose, to get a contradiction, that 3p, q € S with p ^ q and 4>(j)) = 4>{q)'. then pq k. (p{p)(j)(q) = 
4>{p)(j){p) and so, by Axiom [31 p = q against assumption. □ 


If S is congruent to a subset of T, then we say that S is congruently embeddable in T. 


5.3.1 Congruence order 

Now consider a set S € S and an integer n > 0 with the following property: for any T £ 5, if all n-point 
subsets of T are congruent to an n-point subset of S, then T is congruently embeddable in S. If this 
property holds, then S is said to have congruence order n. Formally, the property is written as follows: 

Vre5VT'CT ( (|T'| = 35" C S'(|S"| =nAr'ft! S")) —^ 3RCS{T^R)). (5) 


If |5| < n for some positive integer n, then S can have congruence order n, since the definition is 
vacuously satished. So we assume in the following that |5| > n. 

5.2 Proposition 

If S has congruence order n in S, then it also has congruence order m for each m > n. 


Proof By hypothesis, for every T £ 5, if every n-point subset T' of T is congruent to an n-point subset 
of 5, then there is a subset i? of 5 such that T R. Now any m-point subset of 5 is mapped by ^ to a 
congruent rn-point subset of 5, and again T r; 5, so Eq. m is satisfied for 5 and m. □ 


5.3 Proposition 

(i.e. the Euclidean space which simply consists of the origin) has minimum congruence order 2 in S. 

Proof Pick any T £ 5 with |T| > 1. None of its 2-point subsets is congruent to any 2-point subset of 

since none exists. Moreover, T itself cannot be congruently embedded in IR°, since |T| > 1 = |]R°| and 
no injective congruence map can be defined, against Lemma l5.II So the integer 2 certainly (vacuously) 
satisfies Eq. ([5]) for 5 = M®, which means that K.® has congruence order 2. In view of Prop. 15.21 it also 
has congruence order m for each m > 2. Hence we have to show next that the integer 1 cannot be a 
congruence order for R°. To reach a contradiction, suppose the contract, and let T be as above. By 
Axiom O every singleton subset of T is congruent to a subset of R°, namely the subset containing the 
origin. Thus, by Eq. m, T must be congruent to a subset of R°; but, again, |T| > I = |R°| contradicts 
Lemma [5.11 so T cannot be congruently embedded in which negates Eq. Hence I cannot be a 
congruence order for R°, as claimed. □ 


5.3.2 Monger’s fundamental result 

The fundamental result proved by Menger in 1928 [31] is that the Euclidean space R" has congruence order 
n + 3 but not n-|-2 for each n > 0 in the family 5 of all semi-metric spaces. The important implication of 
Monger’s result is that in order to verify whether an abstract semi-metric space is congruent to a subset 
of a Euclidean space, we only need to verify congruence of each of its n 3- 3 point subsets. 

We follow Blumenthal’s treatment |3], based on the following preliminary definitions and properties, 
which we shall not prove: 
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1. A congruent mapping of a semi-metric space onto itself is called a motion] 

2. n + 1 points in R” are independent if they are not affinely dependent (i.e. if they do not all belong 
to a single hyperplane in R”); 

3. two congruent {n + l)-point subsets of R" are either both independent or both dependent; 

4. there is at most one point of R" with given distances from an independent (n -|- l)-point subset; 

5. any congruence between any two subsets of R” can be extended to a motion; 

6 . any congruence between any two independent (n -I- l)-point subsets of R" can be extended to a 
unique motion. 

5.4 Theorem (Menger |31p 

A non-empty semi-metric space S is congriiently embeddable in R” (but not in any R'’ for r < n) if and 
only if: (a) S contains an (n -f l)-point subset S' which is congruent with an independent (n -|- l)-point 
subset of R"; and (b) each [n -|- 3)-point subset U of S containing S' is congruent to an (n -|- 3)-point 
subset ofR". 

The proof of Monger’s theorem is very formal (see below) and somewhat difficult to follow. It is nonethe¬ 
less a good example of a proof in an axiomatic setting, where logical reasoning is based on syntactical 
transformations induced by inference rules on the given axioms. An intuitive discussion is provided in 
Sect. 15.41 

Proof. (=^) Assume first that S T C R", where the affine closure of T has dimension n. Then T 
must contain an independent subset T' with |T'| = n -|- 1, which we can map back to a subset S' Q S 
using Since are injective, |S"| < \T'\, and by Axiom[2]we have |S"| > \T'\, so |S"| = n -I- 1, 

which establishes (a). Now take any U C S with \U\ = n-\-3 and U D S': this can be mapped via </> to a 
subset IT C T: Lemma IQ ensures injectivity of (p and hence \W\ = n -\-3, establishing (b). 

(<J=) Conversely, assume (a) and (b) hold. By (a), let S' C S with \S'\ = n -|- 1 and S' T' C R”, with 
T' independent and |T'| = n -I- 1. We claim that (p can be extended to a mapping of S into R". Take any 
q € S \ S': by (b). S' U {q} W C R" with |1T| = n-\-2. Note that T' W \ {'p’iq)} by AxiomO 
which implies that for any p € 5", we have uip{p) = ip{p). Moreover, by Property [3] above, W \ {"^Piq)} 
is independent and has cardinality n -I- 1, which by Property [S] above implies that w can be extended to 
a unique motion in R”. So the action of ui is extended to g, and we can define p{q) = u}~^ip(q). We now 
show that this extension of </) is a congruence. Let p,q € S: we aim to prove that pq = <p(j))p{q). Consider 
the set U = S'U{p,q}: since |17| < n-|-3, by (b) there is W C R" with |1T| = \U\ such that U W. As 
above, we note that there is a subset W C W such that |IT'| = n -I- 1 and T' IT', that u}p{r) = 'ip{r) 
for each r G S', and that w is a motion of R". Hence pq = 'tp{p)'tp{q) = (p{p)uj~^(p{q) = p{p)cp{q), as 

claimed. □ 


5.4 An intuitive interpretation 

Although we stated initially that part of the the beauty of the formal treatment of geometry is that 
it is based on symbolic manipulation rather than visual intuition, we quote from a survey paper which 
Menger himself wrote (in Italian, with the help of L. Geymonat) to disseminate the work carried out at 
his seminar |33) . 


Affinche uno spazio metrico reale R sia applicabile a un insieme parziale di R" e necessario e 
sufficiente che per ogni n-\-3 e per ogni n-\-2 punti di esso sia A = 0 e inoltre che ogni n-11 
punti di R siano applicabili a punti di R". 
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The translation is “a real metric space R is embeddable in a subset of K” if and only if A = 0 for each 
(n + 3)- and (n + 2)-point subsets or R, and that each (n + l)-point subset of R is embeddable in R".” 

Since we know that A, the Cayley-Menger determinant of the pairwise distances of a set S of points (see 
Eq. (|4|)), is proportional to the volume of the simplex on S embedded in [S'! — 1 dimensions, what Monger 
is saying is that his result on the congruence order of Euclidean spaces can be intuitively interpreted as 
follows. 


An abstract semi-metric space R is congruently embeddable in R" if and only if: (i) there are 
n+1 points in R which are congruently embeddable in R"; (ii) the volume of the simplex on 
each n + 2 points of R is zero; (iii) the volume of the simplex on each n -|- 3 points of R is 
zero. 


This result is exploited in the algorithm for computing point positions from distances given in [Up. 2284]. 


6 Godel on spherical distances 


Kurt Godel’s name is attached to what is possibly the most revolutionary result in all of mathematics, 
i.e. Godel’s incompleteness theorem, according to which any formal axiomatic system sufficient to encode 
the integers is either inconsistent (it proves A and ->A) or incomplete (there is some true statement A 
which the system cannot prove). This shattered Hilbert’s dream of a formal system in which every true 
mathematical statement could be proved. Few people know that Godel, who attended the Vienna Gircle, 
Menger’s course in geometry, and Menger’s seminar, also contributed two results which are completely 
outside of the domain of logic. These results only appeared in the proceedings of Menger’s seminar |34j . 
and concern DG on a spherical surface. 


6.1 Four points on the surface of a sphere 

The result we discuss here is a proof to the following theorem, conjectured at a previous seminar session 
by Laura Klanfer. We remark that a sphere in R^ is a semi-metric space whenever it is endowed with a 
distance corresponding to the length of a geodesic curve joining two points. 

6.1 Theorem (Godel [20]) 

Given a semi-metric space S of four points, congruently embeddable in R^ but not R^, is also congruently 
embeddable on the surface of a sphere in R^. 

Godel’s proof looks at the circumscribed sphere around a tetrahedron in R^, and analyses the relationship 
of the geodesics, their corresponding chords, and the sphere radius. It then uses a fixed point argument 
to find the radius which corresponds to geodesics which are as long as the given sides. 

Proof. The congruence embedding of S in R^ defines a tetrahedron T having six (straight) sides with 
lengths oi,... , 05 . Let r be the radius of the sphere circumscribed around T (i.e. the smallest sphere 
containing T). We shall now consider a family of tetrahedra t{x), parametrized on a scalar x > 0, defined 
as follows: t{x) is the tetrahedron in R^ having side lengths Cx{ai),..., Cx{ae), where Cx(a) is the length 
of the chord subtending a geodesic having length a on a sphere of radius i. As x tends towards zero, each 
Cx(o.i) tends towards at (for each f < 6 ), since the radius of the sphere tends towards infinity and each 
geodesic length tends towards the length of the subtending chord. This means that t(x) tends towards 
T, since T is precisely the tetrahedron having side lengths ai,..., 05 . For each x > 0, let (j){x) be the 
inverse of the radius of the sphere circumscribed about t(x). Since r(x) —?► T as x —0, and the radius 
circumscribed about T is r, it follows that (j){x) —^ as x —>■ 0. Also, since T exists by hypothesis, we can 
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define t(0) = T and ^(0) 


Also note that it is well known by elementary spherical geometry that: 


Cxia) 



( 6 ) 


Claim : if a' = maxjai,..., oe} then (j) has a fixed point in the open interval I = (0, ^). 

Proof of the claim. First of all notice that r(0) exists, and Cx{a) is a continuous function for x > 0 for 
each a (by Eq. ([6])). Since r(x) is defined by the chord lengths Cx{ai), ..., Cx{ae), this also means that 
t(x) varies continuously for x in some open interval J = (0,e) (for some constant e > 0). In turn, this 
implies that x = max{j/ € I \ T{y) exists} exists by continuity. There are two cases: either x is at the 
upper extremum of /, or it is not. 


(i) If X = then t(x) exists, its longest edge has length Cx{a') = 2^, so, by elementary spherical 
geometry, the radius of the sphere circumscribed around r(x) is greater than ^ , i.e. greater 
than ^ = i. Thus (j){x) < x. We also have, however, that ^(0) = ^ > 0, so by the intermediate 
value theorem there must be some x € (0, x) with (j){x) = x. 

(ii) Assume now x < A- and suppose t(x) is non-planar. Then for each y in an arbitrary small 
neighbourhood around x, T{y) must exist by continuity: in particular, there must be some y > x 
where t(j/) exists, which contradicts the definition of x. So t(x) is planar: this means that each 
geodesic is contained in the same plane, which implies that the geodesics are linear segments. It 
follows that the circumscribed sphere has infinite radius, or, equivalently, that ^(x) = 0 < x. Again, 
by (/)(0) > 0 and the intermediate value theorem, there must be some x € (0, x) with <f>{x) = x. 


This concludes the proof of the claim. 

So now let y be the fixed point of (f. The tetrahedron t(j/) has side lengths Cy(ai) for each i < 6 , and is 
circumscribed by a sphere cr with radius It follows that, on the sphere ct, the geodesics corresponding 
to the chords given by the tetrahedron sides have lengths Oi (for i < 6 ), as claimed. □ 


6.2 Godel’s devilish genius 

Godel’s proof exhibits an unusual peak of devilish genius. At first sight, it is a one-dimensional fixed-point 
argument which employs a couple of elementary notions in spherical geometry. Underneath the surface, 
the fixed-point argument eschews a misleading visual intuition. 

T is a given tetrahedron in which is assumed to be non-planar and circumscribed by a sphere of 
finite positive radius r (see Fig. [SJ left). The map r sends a scalar x to the tetrahedron having as side 
lengths the chords subtending the geodesics of length Oi {i < 6 ) on a sphere of radius ^ (see Fig. [SJ 
right). The map r is such that r(0) = T since for x = 0 the radius is infinite, which means that the 
geodesics are equal to their chords. Moreover, the map (f sends x to the inverse of the radius of the 
sphere circumscribing t(x). Since every geodesic on the sphere is a portion of a great circle, it would 
appear from Fig. |S] (right) that the radius i used to compute Cx{ai) {i < 6 ) is the same as the radius 
of the sphere circumscribing t(x), which would immediately yield ^(x) = x for every x — making 
the proof trivial. There is something inconsistent, however, in the visual interpretation of Fig. [ 8 j the 
given tetrahedron T corresponds to the case t(x) = T, which happens when x = 0, i.e. the radius of 
the sphere circumscribed around T is oo. But this would yield T to be a planar tetrahedron, which is 
a contradiction with an assumption of the theorem. Moreover, if 4>{x) were equal to x for each x, this 
would yield 0 = (/)( 0 ) = i > 0 , another contradiction. 

The misleading concept is hidden in the picture in Fig. [5] (right). It shows a tetrahedron inscribed in 
a sphere, and a spherical tetrahedron on the same vertices. This is not true in general, i.e. the spherical 
tetrahedron with the given curved side lengths oi,..., as cannot, in general, be embedded in the surface of 
a sphere of any radius. For example, the case x = 0 yields geodesics with infinite curvatures (i.e. straight 
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Figure 8: The given tetrahedron T (left), and the tetrahedron t{x) (right). Beware of this visual inter¬ 
pretation: it may yield misleading insights (see Sect. IQ) . 


lines laying in a plane), but 4>{x) = ^ > 0, and there is no flat tetrahedron with the same distances as 
those of T. The sense of Gddel’s proof is that the function Cx simply transforms a set of geodesic distances 
into a set of linear distances, i.e. it maps scalars to scalars rather than geodesics to segments, whereas 
Fig. |S] (right) shows the special case where the geodesics are mapped to the corresponding segments, 
with intersections at the same points (namely the distances oi,..., oe can be embedded on the particular 
sphere shown in the picture). More specifically, the geodesic curves may or may not be realizable on a 
sphere of radius Godel’s proof shows exactly that there must be some x for which (^(cc) = cc, i.e. the 
geodesic curves become realizable. 


6.3 Existential vs. constructive proofs 

Like many existential proofs based on fixed-point theorems^ this proof is beautiful because it asserts 
the truth of the theorem without any certificates other than its own logical validity. An alternative, 
constructive proof of Thm. [01 is given in [311 Thm. 3’]. The tools used in that proof, Gayley-Monger 
determinants and positive semidefiniteness, are discussed in Sect. [7] below. 


7 The equivalence of EDM and PSD matrices 


Many fundamental innovations stem from what are essentially footnotes to apparently deeper or more 
important work. Isaac Schoenberg, better known as the inventor of splines SO], published a paper in 
1935 titled Remarks to Maurice Frechet’s article “Sur la definition axiomatique d’une classe d’espace 
distancies vectoriellement applicable sur I’espace de Hilbert” [39j . The impact of Schoenberg’s remarks 
far exceeds that of the original papc]0: these remarks encode what amounts to the basis of the well- 

^^Interestingly, Godel’s famous incompleteness theorem is also a fixed-point argument (in a much more complicated set), 
not altogether dissimilar situation arose for the Johnson-Lindenstrauss (JL) lemma m- the paper is concerned with 
extending a mapping from n-point subsets of a metric space to the whole metric space in such a way that the Lipschitz 
constant of the extension is bounded by at most a constant factor. Johnson and Lindenstrauss state on page 1 that “The 
main tool for proving Theorem 1 is a simply stated elementary geometric lemma”. This lemma is now known as the JL 
lemma, and postulates the existence of low-distortion projection matrices which map to Euclidean spaces of logarithmically 
fewer dimensions. The impact of the lemma far exceeds that of the main result. 
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known MDS techniques for visualizing high-dimensional data m, as well as all the solution techniques 
for Distance Geometry Problems (DGP) based on Semidefinite Programming (SDP) [29l 1^. 

7.1 Schoenberg’s problem 

Schoenberg poses the following problem, relevant to Monger’s treatment of distance geometry [551 p. 737]. 

Given an n x n symmetric matrix D, what are necessary and sufficient conditions such that 
D is a EDM corresponding to n points in K’’, with 1 < r < n minimum? 

Monger’s solution is based on Cayley-Menger determinants; Schoenberg’s solution is much simpler and 
more elegant, and rests upon the following theorem. Recall that a matrix is PSD if and only if all its 
eigenvalues are nonnegative. 

7.1 Theorem (Schoenberg |39| 1 

The n X n symmetric matrix D = (dij) is the EDM of a set of n points x = {xi,..., Xn} C K’’ (with r 
minimum) if and only if the matrix G = -I- — dfj \ “2 < i, j < n) is PSD of rank r. 

Instead of providing Schoenberg’s proof, we follow a more modern treatment, which also unearths the 
important link of this theorem with classical MDS nni § 2.2.1], an approximate method for finding sets 
of points X = {xi, ... ,Xn} having EDM which approximates a given symmetric matrix. MDS is one of 
the cornerstones of the modern science of data analysis. 

7.2 Tbe proof of Schoenberg’s theorem 

Given a set x = {xi,..., x„} of points in R’’, we can write x as an r x n matrix having Xi as i-th column. 
The matrix G = xx^ having the scalar product XiXj as its component is called the Gram matrix 

or Gramian of x. The proof of Thm. 17.11 works by exhibiting a 1-1 correspondence between squared 
EDMs and Gram matrices, and then by proving that a matrix is Gram if and only if it is PSD. 

Without loss of generality, we can assume that the barycenter of the points in x is at the origin: 

^x,=0. (7) 

i<n 

Now we remark that, for each i,j < n, we have: 

= Ik* “ ~ - ^j) = - 2 XiXj. (8) 

7.2.1 The Gram matrix in function of the EDM 

We “invert” Eq. ([H]) to compute the matrix G = xx"'" = (xiXj) in function of the matrix D^ = (df^). We 
sum Eq. m over all values oi i G {l,...,n}, obtaining: 

'^Xi\xj. (9) 

2<n j 

By Eq. ©, the negative term in the right hand side of Eq. ® is zero. On dividing through by n, we 
have ^ ^ 

i<.n i<.n 


X! + n{xjXj) - 2 


i<.n 


i<.n 


( 10 ) 
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Similarly for j € {1,..., n}, we obtain: 


i ^ 4 = i J2^XjXj). 

j<n 


We now sum Eq. m over all j, gettiirg: 


1 

n 




'Y^ix^Xi) + 

i<n j'^'n i<n 


( 11 ) 


( 12 ) 


(the last equality hr Eq. (TT^ holds because the same quantity f(k) = XkXk is being summed over the 
same range {1,... ,n}, with the symbol k replaced by the symbol i first and j next). We then divide 
through by n to get: 





(13) 


We now rearrange Eq. ( 0 ), (HU), (nni) as follows: 


2 xiXj 

= XiXi + XjXj — djj 

(14) 

XiXi 

j<n 

(15) 

XjXj 

= -X!4 “ 

i<.n i<.n 

(16) 


and replace the left hand side terms of Eq. (I15l) - (ll6p into Eq. (HI to obtain: 


2x,xj = ^Y^^^k + ^Y - 4 -1 


k<r. 


k<r. 


k<r 


whence, on substituting the last term using Eq. dni), we have: 


i X!(4fc + dlj) - 4 

k<n h<n 

— k<n 


(17) 


(18) 


It turns out that Eq. o can be written in matrix form as: 

G = -ij7^V, 

where J = /„ — and 1 = (1,..., 1 ). 

n 


(19) 


7.2.2 Gram matrices are PSD matrices 

Any Gram matrix G = xx^ derived by a point sequence (also called a realization) x = (xi,... ,Xn) in 
for some non-negative integer K has two important properties: (i) the rank of G is equal to the rank 
of X] and (ii) G is PSD, i.e. iTGy > 0 for all y G M”. For simplicity, we only prove these properties in 
the case when x = (xi,..., Xn) is a 1 x n matrix, i.e. x € M", and Xi is a scalar for all i < n (this is the 
case r = 1 in Schoenberg’s problem above). 

(i) The i-th column of G is the vector x multiplied by the scalar Xi, which means that every column 
of G is a scalar multiple of a single column vector, and hence that rkG = 1; 
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(ii) For any vector y, Gy = y'^{xx^)y = {y"^x){x^y) = (x^yY > 0. 

Moreover, G is a Gram matrix only i/it is PSD. Let M be a PSD matrix. By spectral decomposition 
there is a unitary matrix Y such that M = YKY^ ^ where A is diagonal. By positive semidefiniteness, 
Kii > 0 for each i, so v^YdV exists. Hence M = y/YA^y/YA) , which makes M the Gram matrix of the 
vector y/YA. This concludes the proof of Thm. 17.11 

7.3 Finding the realization of a Gramian 

Having computed the Gram matrix G from the EDM D in Sect. 17.21 we obtain the corresponding 
realization x as follows. This is essentially the same reasoning used above to show the equivalence of 
Gramians and PSD matrices, but we give a few more details. 

Let A = diag(Ai,..., A,.) be the r x r matrix with the eigenvalues Ai > ... > A^ along the diagonal 
and zeroes everywhere else, and let Y be the n x r matrix having the eigenvector corresponding to the 
eigenvalue Xj as its j-th column (for j < r), chosen so that Y consists of orthogonal columns. Then 
G = YAY^. Since A is a diagonal matrix and all its diagonal entries are nonnegative (by positive 
semidefiniteness of G), we can write A as where = diag(i/Xi', ■.., Now, since G = xx^, 

xx^ = (X'/A){y/AY^), 

which implies that 

X = y VA (20) 

is a realization of G in R’’. 


7.4 Multidimensional Scaling 

MDS can be used to find realizations of approximate distance matrices D. As above, we compute 
G = Since D is not a EDM, G will probably fail to be a Gram matrix, and as such might 

have negative eigenvalues. But it suffices to let Y be the eigenvectors corresponding to the H positive 
eigenvalues Ai,..., Xh, to recover an approximate realization x oi D in . 

Another interesting feature of MDS is that the dimensionality H of the ambient space of x is actually 
determined by D (or D) rather than given as a problem input. In other words, MDS finds the “inherent 
dimensionality” of a set of (approximate) pairwise distances. 


8 Conclusion 


We presented what we feel are the most important and/or beautiful theorems in DG (Heron’s, Cauchy’s, 
Cayley’s, Monger’s, Godel’s and Schoenberg’s). Three of them (Heron’s, Cayley’s, Monger’s) have to 
do with the volume of simplices given its side lengths, which appears to be the central concept in DG. 
We think Cauchy’s proof is as beautiful as a piece of classical art, whereas Godel’s proof, though less 
important, is stunning. Last but not least, Schoenberg’s theorem is the fundamental link between the 
history of DG and its contemporary treatment. 
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